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(54) Method and apparatus for start code detection in a compressed bitstream 



(57) In coded representation of audio, video and 
system bitstreams, it is common to insert start codes to 
facilitate synchronization points. The start codes are 
usually unique patterns that cannot be duplicated within 
the bitstream. In the decoding process it is necessary to 
detect these start codes in order to begin the decoding 
of the bitstream in a synchronized way. In typical cases, 
it is normal for the start code to be byte aligned and 



have the first few bytes comprising of the same pattern. 
The present invention is a method and apparatus that 
exploit this feature to reduce the number of compari- 
sons required. A specific order is used in comparing the 
input bitstream to the start code pattern. Using this 
order the decode, only need to compare a subset of the 
bytes during the start code detection process. 
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Description 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0001] This invention is applicable to the field of digital 
video compression. It can be used to reduce the execu- 
tion time, computational requirements as well as mem- 
ory accesses required to search for the start code 
present in compressed bitstream. Start code search is 
common in the multiplexing and demultiplexing of multi- 
ple elementary video and audio stream. It is also used 
in the video and audio decoders to identify the start of 
various synchronization points in the elementary 
stream. In general this invention is useful when a search 
for a unique pattern within a bitstream is required. This 
invention is applicable when the start code exhibits the 
following characteristics of being byte aligned and 
begins with repeated bytes. 

2. Description of the Related Art 

[0002] In compressed digital audio and video data it is 
usually coded in segments representing a fixed frame of 
audio samples or parts of a picture. In each segment, 
the context of the current bit being decoded depends on 
the bits already decoded. Therefore the segment bit- 
stream must be decoded sequentially from beginning to 
end. It is common to insert a unique sequence of bits or 
bytes to separate the data or indicate the start of a new 
segment of the data. This unique sequence of bits or 
bytes are commonly referred to as start codes. Start 
codes are useful for random access into the bitstream 
so that the decoder can start at the next possible entry 
point. It is also useful when the decoder looses the con- 
text of the bitstream due to data error and needs to re- 
synchronize to the next segment of error free bitstream. 
In the case of the multiplexer that combines the audio 
and video elementary streams together, time stamps 
are tagged to start codes to identify the time to present 
the decoded data in order to achieve audio and video 
synchronization. Therefore it is desirable to be able to 
identify these start codes quickly. 
[0003] An example of start codes used can be found 
in compression standards such as MPEG-1 and MPEG- 
2. In the rest of this document the start codes of the 
MPEG-2 video is used as an example. It contains 23 
zeros followed by a 1 . The byte following this will then 
indicate the type of start code. All the start codes are 
byte aligned. This means that they will always start at 
the beginning of the byte boundary. This simplifies the 
search as the comparison can be done at the byte level 
and not at the bit level. 

[0004] For the rest of the document the 
sequence_header_code will be used as an 
example the bit pattern is 

"0000000000000000000000001 1011001 1 " in binary or 



"00 h 00 h 01 h B3 h " in hexadecimal. 
[0005] An example of the prior art is shown in Figs. 1 
and 2. It is common to scan through the bitstream com- 
paring each bit or byte against the unique start code. 

5 Fig. 1 shows a state diagram of a serial start code 
decoding process according to the prior art. In the serial 
start code decoding process, the bytes are sequentially 
compared to the start code pattern. The state machine 
changes state from state 1 to 4 depending on how many 

10 of the bytes have matched with the pattern. More specif- 
ically, Fig. 1 shows the state diagram for the search of a 
start code representing the sequence_header_code. 
The state machine begins in state 0 where everything is 
reset. It stays in this state until the next byte is 00 h when 

is it goes into state 1 . If the next byte is also 00 h then it 
goes to state 2. Otherwise it reverts to state 0. From 
state 2 the state machine goes into state 3 if the next 
byte is 01 h . If however the next byte is again 00 h the 
machines remains in state 2. Otherwise it again reverts 

20 to state 0. Finally from state 3 if the next byte is B3 h , 
then the state machine arrives at state 4 which is the 
final state where the start code has been found. 
[0006] Fig. 2 shows the logical flow chart equivalent of 
the state machine. The logic begins in module 1 where 

25 the next byte is shifted for comparison in module 2. If the 
byte is not equal to 00 h the flow returns to module 1 
where the subsequent byte is checked. If the byte is 00 h 
then the logic flows to module 3 where the next byte is 
shifted in and compared in module 4. If the byte is not 

30 equal to 00 h , then the flow returns to module 1 where 
the subsequent byte is checked. If the byte is 00 h the 
logic flows to module 5 where the next byte is shifted in 
and compared in module 6. If the byte is again 00 h the 
logic returns to module 5. If the byte is 01 h then the logic 

35 flows to module 7. Otherwise the logic flows back to 
module 1. In module 7 the next byte is shifted in and 
compared in module 8. If the byte is B3 h then the start 
code has been found. Otherwise the logic flows back to 
module 1 . 

40 [0007] Fig. 1 and 2 illustrates the prior art where each 
byte in the bitstream is scanned sequentially against the 
pattern of the start code. The start code is found when 
4 consecutive bytes in the bitstream matches the four 
consecutive bytes in the start code pattern. 

45 [0008] The problem to be solve is to reduce the 
number of comparison required thus reducing the 
number of memory accesses as well as the computa- 
tional requirements. As demonstrated in the example of 
the prior art in Figs. 1 and 2, each byte of the bitstream 

so has to be compared to a particular pattern. 

SUMMARY OF THE INVENTION 

[0009] In order to decrease the number of bytes com- 
55 pare during the start code search, this invention exploits 
the redundancy in the start code pattern. The following 
conditions are necessary for this invention to be useful. 
Firstly, the start code must be byte aligned thus implying 
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that the start code can be aligned to any byte boundary. 
Therefore the search and comparison must be done 
byte by byte. Secondly, the start code pattern must con- 
tain two or more consecutive bytes that are the same. 
[001 0] The example start code from MPEG-2 satisfies 
the conditions above. The start code is byte aligned and 
the first two bytes in the start code pattern are both 00 h - 
Therefore by comparing the second byte instead of the 
first byte we can make more than one conclusions when 
the byte is not 00 h . The first conclusion is that the start 
code cannot begin at the first byte since the second byte 
of the bitstream and the second byte of the start code 
pattern do not match. The second conclusion is that the 
start code cannot begin at the second byte since the 
second byte of the bitstream and the first byte of the 
start code also do not match. Therefore by making one 
comparison we can make two conclusions if the com- 
parison fails. This implies that only every second byte 
need to be checked. Thus halving the number of com- 
parison and memory access. 

[0011] According to a first aspect of the invention, a 
method of partitioning the coded representation of 
audio visual material into a plurality of access units 
whereby the beginning of each access unit is marked by 
a unique start code and that this start code is aligned to 
the byte boundaries, said method comprises: 

making a first part by a series of two or more identi- 
cal byte patterns; and 

making a second part by one or more bytes that 
together with the first part forms a unique bit pattern 
that cannot be duplicated by the rest of the valid bit- 
stream. 

[001 2] According to a second aspect of the invention, 
a method of start code detection in a bitstream compris- 
ing coded representation of audio visual material having 
the properties as in the first aspect, the method com- 
prises the steps of; 

aligning the bitstream to its byte boundary; 
examining the bitstream for the start code pattern 
by comparing the byte at the location correspond- 
ing to the last byte of the series of identical byte pat- 
terns of the start code; 

skipping all the bytes up to and including the said 
byte if the comparison shows that the byte is differ- 
ent from the byte pattern of the start code at that 
location; otherwise 

proceeding with byte by byte comparison of the 
remaining bytes of the start code pattern to deter- 
mine if there is a match. 

[001 3] According to a third aspect of the invention, an 
apparatus for start code detection in a bitstream com- 
prising coded representation of audio visual material 
having the properties as in the first aspect comprises: 



means for aligning the bitstream to its byte bound- 
ary; 

means for examining the bitstream for the start 
code pattern by comparing the byte at the location 

5 corresponding to the last byte of the series of iden- 
tical byte patterns of the start code; 
means for skipping all the bytes up to and including 
the said byte if the comparison shows that the byte 
is different from the byte pattern of the start code at 

10 that location; otherwise 

means for proceeding with byte by byte comparison 
of the remaining bytes of the start code pattern to 
determine if there is a match. 

15 [0014] According to a fourth aspect of the invention, in 
the method of the first aspect, the first part of the unique 
start code is made up of two consecutive bytes of zero 
bit. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 
[0015] 

Fig. 1 illustrates a state diagram of a serial start 
25 code decoding process according to the prior art. 

Fig. 2 illustrates a flow chart of the comparisons 
made with respect to the diagram of Fig. 1 , accord- 
ing to the prior art. 

Fig. 3 illustrates a state diagram of the according to 

30 the present invention. 

Fig. 4 illustrates a flow chart of the comparisons 
made with respect to the diagram of Fig. 3, accord- 
ing to the present invention. 
Fig. 5 illustrates an apparatus for the start code 

35 detection process according to the present inven- 
tion. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

40 

[0016] Referring to Fig. 3, a state diagram of the 
present invention is shown where the second byte is 
first compared to the pattern. This allows the compari- 
son to skip over the first byte without comparing it if the 

45 second byte do not match the pattern. More specifically, 
Fig. 3 shows the conclusion that can be drawn when the 
second byte in the bitstream is and is not 00 h . The state 
machine begins in state 0 where everything is reset. It 
stays in this state until the second byte is 00 h when it 

so goes into state 1 . If the first byte is also 00 h then it goes 
to state 3. Otherwise it goes to state 2. In state 2 if the 
next byte is 00 h the state machine goes to state 3. Oth- 
erwise it reverts to state 0 and everything is reset. From 
state 3 the state machine goes into state 4 if the next 

55 byte is 01 h . If however the next byte is again 00 h the 
machines remains in state 3. Otherwise it again reverts 
to state 0. Finally from state 4, rf the next byte is B3 h , 
then the state machine arrives at state 5 which is the 
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final state where the start code has been found. The 
main difference when compared to the state machine of 
the prior art is that the comparison starts in the second 
byte. This implies that when a comparison fails and 
goes back to state 0, two bytes are skipped instead of 5 
one. This is more clearly illustrated in the flow chart in 
Fig. 4. 

[0017] In Fig. 4. the operations begin by comparing 
the second byte of the bitstream in module 10. If the 
byte is equal to 00 h , the comparison proceeds to mod- 10 
ule 1 1 where the first byte is compared. Otherwise the 
operation is passed to module 12 where two bytes are 
skipped before the operation continues in module 10 
again. In module 1 1 , if the first byte is equal to 00 h then 
the processing is passed to module 13. Otherwise the 75 
processing is passed to module 14 where the next byte 
is shifted in for comparison in module 15. In module 15, 
if the byte is 00h the processing is again passed to mod- 
ule 13. Otherwise the comparison thus far has failed 
and two bytes are skipped in module 1 2 before the proc- 20 
ess begins again in module 10. In module 13, the next 
byte is shifted in for comparison in module 16. If the byte 
is equal to 01 h , then three of the four bytes have 
matched and the processing is passed to module 1 7. If 
however, the byte is not 01 hl there is a possibility that it 25 
may be 00 h . The processing in this case is passed to 
module 15 for comparison (without any bytes being 
shifted). In module 17, the next byte is shifted in for 
comparison in module 18. If the byte is B3 hl then the 
start code have been found. Otherwise, the comparison 30 
have again failed and two bytes are skipped in module 
12before the process begins again in module 10. 
[0018] Fig. 5 shows an apparatus for performing the 
start code detection of the current invention. The bit- 
stream, 51 , is passed into the byte aligned memory 35 
buffer module, 52, where it is byte aligned and buffered. 
The Byte Reordering and Selection Module, 53, then 
extracts the correct number of bytes from the buffer and 
rearrange the order of the bytes into the order of com- 
parison of the current invention. The Start Code Com- 40 
parator and Decision Module, 54, request for the 
desired byte through control line, 56, from the Byte 
Reordering and Selection Module. The comparison is 
made in the comparator module and the result is 
passed through line 59. Depending on the outcome of 45 
the comparison, the Shifter Module, 55, is controlled via 
line 57. The Shifter Module in turn controls the Byte 
Aligned Memory Buffer Module via; line 58, to shift the 
correct number of bytes in to and out of the buffer. 
[0019] The effect of the invention is that there is high so 
likelihood that more than one byte is skipped when the 
comparison fails. A simple estimation of the savings can 
be made by assuming that all 8 bit symbols are equally 
likely to occur. This gives the probability of the symbol 
XX h , p(XX h ), as 1 in 256. It can then be shown that the 55 
probability of being in state 0 in both the prior art and in 
the current invention is very close to 1 This means that 
in the prior art we are always shifting one byte and mak- 
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ing one comparison. However, in the current invention, 
we are most of the time shifting two bytes and only com- 
paring the second byte. Therefore the number of opera- 
tions required is also halved. 

[0020] This invention therefore effectively half the 
number of comparisons and memory accesses needed. 
This is particularly useful in a software environment 
where the processing resources saved can be utilized 
for other operations. 

Claims 

1. A method of partitioning the coded representation 
of audio visual material into a plurality of access 
units whereby the beginning of each access unit is 
marked by a unique start code and that this start 
code is aligned to the byte boundaries, said method 
comprising: 

making a first part by a series of two or more 
identical byte patterns; and 
making a second part by one or more bytes 
that together with the first part forms a unique 
bit pattern that cannot be duplicated by the rest 
of the valid bitstream. 

2. A method of start code detection in a bitstream 
comprising coded representation of audio visual 
material having the properties as in claim 1 , com- 
prising the steps of: 

aligning the bitstream to its byte boundary; 
examining the bitstream for the start code pat- 
tern by comparing the byte at the location cor- 
responding to the last byte of the series of 
identical byte patterns of the start code; 
skipping all the bytes up to and including the 
said byte if the comparison shows that the byte 
is different from the byte pattern of the start 
code at that location; otherwise 
proceeding with byte by byte comparison of the 
remaining bytes of the start code pattern to 
determine if there is a match. 

3. An apparatus for start code detection in a bitstream 
comprising coded representation of audio visual 
material having the properties as in claim 1 , com- 
prising: 

means for aligning the bitstream to its byte 
boundary; 

means for examining the bitstream for the start 
code pattern by comparing the byte at the loca- 
tion corresponding to the last byte of the series 
of identical byte patterns of the start code; 
means for skipping all the bytes up to and 
including the said byte if the comparison shows 
that the byte is different from the byte pattern of 
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the start code at that location; otherwise 
means for proceeding with byte by byte com- 
parison of the remaining bytes of the start code 
pattern to determine if there is a match. 

5 

4. A method according to claim 1, wherein the first 
part of the unique start code is made up of two con- 
secutive bytes of zero bit. 

10 



15 



20 



25 



30 



35 



40 



45 



SO 



55 



P O914009A2_l_> 



EP 0 914 009 A2 




6 

BNSDOCID: <EP 091 4009A2_I_> 



EP 0 914 009 A2 



Fig.2 PRIOR ART 

( Begin ) 



Next Byte 




EP 0 914 009 A2 




3::SDOCID: <EP 0914009A2J_> 



EP 0 914 009 A2 



Fig A 



12 



I 



Skip Two Bytes 



Next Byte 




YES 



( Begin ) 




YES 



Next Byte 




YES 



( Found ) 



■NSOOCID; <EP 031 4009 A2J_> 



9 



EP 0 914 009 A2 



Fig. 5 
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